Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems
نویسندگان
چکیده
This paper introduces an effective communication-induced checkpointing protocol using message logging to enable the number of extra checkpoints be far lower than previous number. Even if a situation occurs in which it is decided that process receiving has perform forced checkpointing, our allows skip action recognizes state its sender right before receipt recoverable. Additionally, thus not required assume piecewise deterministic model, despite being combined with logging. can maintain these features by piggybacking one-bit variable and n-size vector on each sent. Our simulation results verify claim presented performs much better representative optimized respect frequency, regardless communication pattern.
منابع مشابه
Recovery in Distributed Systems Using Optimistic Message Logging and Checkpointing
In a distributed system using message logging and checkpointing to provide fault tolerance there is always a unique maximum recoverable system state regardless of the message logging protocol used The proof of this relies on the observation that the set of system states that have occurred during any single execution of a system forms a lattice with the sets of consistent and recoverable system ...
متن کاملOptimistic Message Logging for Independent Checkpointing in Message-Passing Systems
Message-passing systems with communication protocol transparent to the applications typically require message logging to ensure consistency between checkpoints. This paper describes a periodic independent checkpointing scheme with optimistic logging to reduce performance degradation during normal execution while keeping the recovery cost acceptable. Both time and space overhead for message logg...
متن کاملAnti-message Logging Based Coordinated Checkpointing Protocol for Deterministic Mobile Computing Systems
A checkpoint algorithm for mobile computing systems needs to handle many new issues like: mobility, low bandwidth of wireless channels, lack of stable storage on mobile nodes, disconnections, limited battery power and high failure rate of mobile nodes. These issues make traditional checkpointing techniques unsuitable for such environments. Minimum-process coordinated checkpointing is an attract...
متن کاملNew Causal Message Logging Protocol with Asynchronous Checkpointing for Distributed Systems
Causal message logging is an efficient approach for tolerating failures of processes in distributed systems because it has the advantages of both pessimistic and optimistic message logging approach. However, traditional causal message logging protocols prevent live processes from executing continuously their computation and require some synchronous logging to the stable storage during recovery....
متن کاملDistributed System Fault Tolerance Using Message Logging and Checkpointing
Fault tolerance can allow processes executing in a computer system to survive failures within the system This thesis addresses the theory and practice of transparent fault tolerance methods using message logging and checkpointing in distributed systems A general model for reasoning about the behavior and correctness of these methods is developed and the design implementation and performance of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Electronics
سال: 2021
ISSN: ['2079-9292']
DOI: https://doi.org/10.3390/electronics10121428